<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
"http://www.w3.org/TR/html4/strict.dtd">

<html>
  <head>
    <meta http-equiv="Content-Type" content='text/html; charset=utf-8' />
    <link rev="made" href="mailto:clsung@FreeBSD.org">
    <title>YASA: Yet Another Suffix Array</title>
    <link type="text/css" rel="stylesheet" href="/css/default.css">
<script type="text/javascript">
var gaJsHost = (("https:" == document.location.protocol) ? "https://ssl." : "http://www.");
document.write(unescape("%3Cscript src='" + gaJsHost + "google-analytics.com/ga.js' type='text/javascript'%3E%3C/script%3E"));
</script>
<script type="text/javascript">
var pageTracker = _gat._getTracker("UA-106713-2");
pageTracker._trackPageview();
</script>
  </head>

  <body>
    <h1>YASA: Yet Another Suffix Array</h1>

    <h2>Introduction</h2>

      <p><b>YASA</b> is a simple implementation of 
      <a href="http://sary.sourceforge.net/docs/suffix-array.html">Suffix Array</a>
      for text/string processing.

    <h2>Table of contents</h2>
    <ul>
      <li><a href="#features">Features</a></li>
      <li><a href="#license">License</a></li>
      <li><a href="#downloads">Download</a></li>
      <li><a href="#news">News</a></li>
      <li><a href="#links">Links</a></li>
      <li><a href="demo">Demo</a></li>
    </ul>

    <h2><a name="features">Features</a></h2>

    <ul>
     <li>Written in C</li>
     <li>Be capable to process UTF-8 strings / ASCII sentences</li>
    </ul>

    <h2><a name="license">License</a></h2>

    <ul>
	<li>YASA is release under 
	<strong><a href="http://www.opensource.org/licenses/cddl1.php">Common Development and Distribution License (CDDL)</a></strong>
	</li>
    </ul>

    <h2><a name="downloads">Download</a></h2>
    <ul>
       <ul>
        <li><a href="downloads/yasa-0.2.3.tar.gz">YASA 0.2.3</a>
       </ul>
    </ul>

    <h2><a name="news">News</a></h2>
    <ul>
     <strong>2007-10-23</strong>: YASA 0.2.3<br>
       <ul>
	<li>change flag in mmap_open(), now it works on Linux<br />
	    thanks to Pi-Chuan Chang</li>
       </ul>
     <strong>2007-10-22</strong>: YASA 0.2.2<br>
       <ul>
	<li>better logging (Debug)</li>
	<li>add unit test (CUnit)</li>
       </ul>
     <strong>2007-06-08</strong>: YASA 0.2.1<br>
       <ul>
	<li>isspace set to ismyalpha (only check ' ')</li>
	<li>for non-UTF8 string, word_ptr increase via word_ptr++</li>
	<li>Update usage</li>
       </ul>
     <strong>2007-04-12</strong>: YASA 0.02<br>
       <ul>
	<li>CodeJam, now addes shared library</li>
	<ul>
	    <li>libyasa.so.0.2</li>
	</ul>
	<li>also create test/, for TAP (Test Anything Protocol)</li>
	<ul>
            <li>I love unit test :)</li>
            <li>still premature</li>
	</ul>
       </ul>
     <strong>2006-10-28</strong>: YASA 0.01 Released<br>
       <ul>
        <li>Official Name "YASA: Yet Another Suffix Array" release</li>
       </ul>
    </ul>

<h2><a name="links">References</a></h2>
    <ul>
	<li>Bill Frakes. <a href="http://portal.acm.org/citation.cfm?id=129687.129692">
	    New Indices for Text: PAT trees and PAT arrays</a>, Information Retrieval: Structures and Algorithms, 1992.
	</li>
	<li>L. F. Chien. <a href="http://portal.acm.org/citation.cfm?id=258534">
	    PAT-Tree-Based Keyword Extraction for Chinese Information Retrieval</a>, Proceedings of the ACM SIGIR International Conference on Information Retrieval, 1997.
	<li>Ted Pedersen.  <a href="http://www.d.umn.edu/~tpederse/nsp.html">
	    N-gram Statistics Package</a>, 2003.
    </ul>
    <hr>

    <address>
      clsung@FreeBSD.org
    </address>
  </body>
</html>

